Multi-modal human-machine communication for instructing robot grasping tasks
نویسندگان
چکیده
A major challenge for the realization of intelligent robots is to supply them with cognitive abilities in order to allow ordinary users to program them easily and intuitively. One way of such programming is teaching work tasks by interactive demonstration. To make this effective and convenient for the user, the machine must be capable to establish a common focus of attention and be able to use and integrate spoken instructions, visual perceptions, and non-verbal clues like gestural commands. We report progress in building a hybrid architecture that combines statistical methods, neural networks, and finite state machines into an integrated system for instructing grasping tasks by man-machine interaction. The system combines the GRAVIS-robot for visual attention and gestural instruction with an intelligent interface for speech recognition and linguistic interpretation, and an modality fusion module to allow multi-modal task-oriented man-machine communication with respect to dextrous robot manipulation of objects.
منابع مشابه
Learning issues in a multi-modal robot-instruction scenario
One of the challenges for the realization of future intelligent robots is to design architectures which make user instruction of work tasks by interactive demonstration effective and convenient. A key prerequisite for enhancement of robot learning beyond the level of low-level skill acquisition is situated multi-modal communication. Currently, most existing robot platforms still have to advance...
متن کاملMulti-Modal RGBD Sensors for Object Grasping and Manipulation
RGBD sensors, such as the Microsoft Xbox Kinect [1] are types of multi-modal perceptual sensors that have appeared in recent years. RGBD sensors have become standard perceptual tools for robots as they provide a unique multi-modal approach to perception. A vital pre-cursing challenge in object grasping and manipulation is object pose recognition. A robot must identify the pose (i.e. orientation...
متن کاملSituated robot learning for multi-modal instruction and imitation of grasping
A key prerequisite to make user instruction of work tasks by interactive demonstration effective and convenient is situated multi-modal interaction aiming at an enhancement of robot learning beyond simple low-level skill acquisition. We report the status of the Bielefeld GRAVIS-robot system that combines visual attention and gestural instruction with an intelligent interface for speech recognit...
متن کاملGuiding attention for grasping tasks by gestural instruction: the GRAVIS-robot architecture
A major goal for the realization of a new generation of intelligent robots is the capability of instructing work tasks by interactive demonstration. To make such a process efficient and convenient for the human user requires that both the robot and the user can establish and maintain a common focus of attention. We describe a hybrid architecture that combines neural networks and finite state ma...
متن کاملHierarchical control method for manipulating/grasping tasks using multi-fingered robot hand
In this paper, we propose a hierarchical control method for manipulation/grasping tasks using multi-fingered robot hands. This method imitates human motion control. Human motion can be divided into two classes, reflex and voluntary movement. Reflex is suppressed by voluntary movement. In the proposed method, a robot hand’s grasping control corresponds to human reflex and manipulation control co...
متن کامل